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RECOMBINANT bHLH-PAS/JHR POLYPEPTIDE AND ITS USE 

TO SCREEN POTENTIAL INSECTICIDES 



5 BACKGROUND OF THE INVENTION 

The present invention relates to a cloned "basic 
helix loop helix -PER-ARNT-AhR-SIM 1 ' (bHLH-PAS) protein 
that is a juvenile hormone receptor (JHR) , bHLH-PAS/JHR. 
In particular, this invention is directed to a bHLH- 

10 PAS/ JHR gene isolated from Drosophila, termed the 
me thoprene- tolerant (met) gene (Met- JHR) . The present 
invention also is directed to in vitro and in vivo 
methods for screening insecticides using recombinant 
bHLH-PAS / JHRs . The present invention is further directed 

15 to methods for isolating polynucleotides encoding bHLH- 
PAS/ JHRs from various insect species . 

Worldwide insect damage to food and fiber costs 
billions of dollars annually. Although chemical 

insecticides are still the primary means of insect 

2 0 control, the use of chemicals has several drawbacks 

including high cost of discovery, potential environmental 
damage, and negative public opinion. One promising group 
of insecticides consists of analogues of insect hormones, 
such as juvenile hormone. Since vertebrates do not make 
25 juvenile hormone (JH) , insecticides targeted to the JH 
system are highly toxic to certain insects, and have 
shown an extraordinary degree of environmental safety. 

Juvenile hormones comprise a family of hormones that 
are secreted by the corpus allatum, and that play a role 

3 0 in a variety of critical functions in insects, including 

development, reproduction, and morphological 
differentiation, Riddiford, "Hormone Action at the 
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molecules. A genomic polynucleotide comprising the Met 
gene is the St-H fragment in Figure 1. This fragment is 
6.234 Kb, and its sequence is shown in Figure 2 (SEQ ID 
NO:l). Within this 6.234 Kb segment, there is a DNA 
5 sequence of 3.011 Kb, which includes an open reading 
frame that is divided by one intron of 69 nucleotides 
(bases 1520 to 1588). This 3.011 Kb sequence is the 
genomic Met-JHR DNA sequence. Figure 3 (SEQ ID NO:2) . 
The Met-JHR open reading frame lacking the intron 
10 codes for a protein of 716 amino acids and a having a 
molecular weight of about 78,720 daltons. 

The nucleotide sequences of the genomic and cDNA Met- 
JHR differ, reflecting polymorphism. In Figure 3, SEQ ID 
NO: 3 represents a Met-JHR cDNA sequence, which begins at 
15 nucleotide 4 of the genomic sequence. There is one 
"polymorphic" difference between the genomic and cDNA 
nucleotide sequences that results in a change at the 
amino acid level. The nucleotide at position 1043 
(genomic) / 103 9 (cDNA) may be C or T, which results in 

2 0 different deduced amino acids, R and W, respectively. 

In the sequence of the genomic DNA, there is one 
ambiguity that results in different deduced amino acids. 
Base number 875 in the genomic DNA is designated "R, " 
which signifies that the nucleotide may be the purine C 
25 or G. This results in two possible corresponding deduced 
amino acid sequences, G (Gly) or R (Arg) respectively. 
In the sequence of the cDNA, there is one ambiguity that 
results in a different deduced amino acids. Base number 
526 in the genomic DNA is designated "M, " which signifies 

3 0 that the nucleotide may be the purine A or C. This 

results in two possible corresponding deduced amino acid 
sequences, T (Thr) or P (Pro) respectively. 

As used herein, the term "juvenile hormone 
receptor" (JHR) is used to mean a polypeptide that is 
3 5 involved in binding JHIII. As used herein, a polypeptide 
that is "involved in binding" JHIII includes a 
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transcript with the genomic nucleotide sequence. The 
probable transcription start site for this transcript 
begins 220 bp upstream from the start codon and the 
probably transcript ends 912 bp from the stop codon. 
5 A comparison of the cDNA to the genomic sequence 

showed that the genomic ORF is 2.22 Kb and the cDNA ORF 
is 2.151 Kb. The difference between the two sequences is 
a 69 nucleotide intron, which corresponds to 23 codons , 
and does not change the open reading frame of the genomic 

10 and cDNA. The presence of the intron provides evidence 
for the possibility of alternatively spliced variants of 
Met-JHR and hence multiple isoform proteins of Met-JHR. 

The longest single open reading frame in the cDNA in 
Figure 3 (SEQ ID NO: 3) comprises a single open reading 

15 frame (ORF) . The DNA sequence (CAAAATGGCA) (SEQ ID 
NO: 13) surrounding this ATG of the ORF is in good 
agreement with a Drosophila translation start site 
consensus sequence. Cavener Nucl. Acid. .Res. 15:1353 
(1987) . 

2 0 The first genomic exon is from position 224 to 

position 1543 (1296 bases) . This is followed by a 69 bp 
intron, and a second exon, which extends from position 
1589 to 2443 (855 bases) . The remainder of the Met-JHR 
gene is from 2443 to 3011 (568 bases) . The total length 

2 5 of the nucleotide sequence provided for the genomic DNA 

is 3011 nucleotides (SEQ ID NO:2) , and that of the cDNA 
is 32 82 nucleotides (SEQ ID NO: 3) . 

Comparison of the Met-JHR ORF and with sequences 
deposited in the Genbank database showed three regions of 

3 0 homology to members of a family of transcriptional 

activators known as the basic helix-loop-helix-Per-Arnt- 
Sim (bhlh-PAS) proteins. See Figure 6. Three vertebrate 
members of this family include the aromatic hydrocarbon 
receptor nuclear translocator (ARNT) , muscle and brain 
3 5 ARNT- like protein 1 (BMAL-1) , and the aromatic 
hydrocarbon receptor (AHR) . Three Drosophila. family 
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members include ARNT (DARNT), Trachealess (Trh) and 
Single-minded {Sim) . 

The ARNT and AHR proteins are involved as 
heterodimeric partners in binding a variety of 
5 environmental toxicants, including dioxin, and 
subsequently activating a variety of genes important in 
the degradation of these chemicals, -such as the 
cytochrome P450 genes. Figure 6 indicates that .Met-JHR 
is neither DARNT nor AHR. However, Met-JHR shares 

10 considerable homology to human AHR in the ligand binding 
region of AHR, which is amino acids 200-400 of AHR. 
Rowlands et al . Crit. Rev. Toxicol. 27:109 (1997). 
Another feature apparent from visual inspection of the 
Met-JHR sequence is that Met-JHR, like human ARH (HARH) , 

15 has a high concentration of serine and threonine residues 
at its carboxyl terminus. This is the motif of a S/P/T 
transact ivat ion domain, as noted above. In ARH, this 
domain has been shown to be a functional TAD. 

These features support the hypothesis that the 

20 mechanism of action of Met-JHR is similar to AHR, i.e., 
Met-JHR binds the JH ligand. In addition, the Met-JHR 
may heterodimerize to DARNT or a DARNT- like protein in 
order to bind a JH response element and mediate JH 
action. The bHLH domain has been shown to be involved in 

25 dimerization and DNA binding. Rowlands et al . Critical 
Reviews in Toxicology, 21: 109 (1997). 

Met-JHR also contains the "LXXLL" (SEQ ID NO: 14) 
motif which likens Met to steroid receptor co-activators. 
Although' this motif is found in many proteins, it plays 

30 a significant role in proteins that interact with co- 
activators of steroid receptors. LXXLL (SEQ ID NO: 14) 
also has been found in a bHLH -PAS protein that is a 
cofactor (ACTR) [Chen et al. Cell 90:569 (1997)] that is 
amplified in breast cancer-1 (AIBC) . Anzick et al . 

35 Science 277:965 (1977). This bHLH -PAS protein 

(ACTR/ AIBC) interacts with a steroid 
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and the DNA binding region of a second polypeptide, (2) 
DNA encoding a bHLH-PAS polypeptide and the activation 
domain of said second polypeptide, and (3) a reporter 
gene under the control of a minimal promoter driven by 
the response element for said second polypeptide; 

(b) incubating a test compound with said host cell; 

and 

(c) detecting the binding of the test compound with 
said bHLH-PAS polypeptide by monitoring expression of the 
reporter gene. 

37. A method according to claim 32, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

38. A method according to claim 33, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

39. A method according to claim 34, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

40. A method according to claim 35, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

41. A method according to claim 36, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

^ 42. An isolated polynucleotide which comprises the 
sequence of SEQ ID NO: 6. 

43. An isolated polynucleotide which comprises the 
sequence of SEQ ID NO: 7. 
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<110> Wilson, Thomas G . 

Heinrich, Julia N. 
American Cyanamid Company 

<120> RECOMBINANT BHLH-PAS/JHR POLYPEPTIDE AND ITS USE TO 
SCREEN POTENTIAL INSECTICIDES 

<130> 082584/104 



<140> PCT/US98/07388 
<141> 1998-04-14 

<150> 08/971, 188 
<151> 1997-11-17 



<150> 08/843,205 
<151> 1997-04-14 

<160> 14 



<170> Patentln Ver. 2.0 

<210> 1 

<211> 6234 

<212> DNA 

<213> St-H segment 



<400> 1 

aggccttgac attgttaatc aggtggctca atccattata gccatcctta tcgaagtcga 60 
tgttctcgcc gccaaactcg atgacctgca aatgaatagt gggcaaatgg tttgggatca 120 
ttgaaataaa tgactccatg gattgagtgc acccacctcc ttggccctga aggtgtccgc 180 
accatggggc actccaatga actcgaacca ggccccttcg ggggggccca gctcaccggc 240 
agattgatct ttagcttgtg gttgtctgtg gaactggcca gcaaagtgga tcccgcgacg 300 
ctctccacgc gaaccattat gctcacggtt tggccggaaa actgcttcag catgccgccg 360 
ttgataatcg agcgtggatc aaaggcatcc ataataactc ctcagttaat aacgttcaaa 420 
aaacagccaa tttcagcaga aaatgaaatg ccgtaaacaa gaaagagcgc caatagttat 480 
cgagcgtgct aatcgataat aaaattgcat gacgtctgct atcgaaatta ttcgaatttt 540 
gtagcaaaaa tatttatggg atttattgta acatcaaaag ttattacatt attactgtgg 600 
acttatgtat tataataaga attaataaaa ataatttata atagaattat aaaaagacct 660 
gatagattac atgtattaca acaatgttat cgacagcgaa caagatgttc gaatctttct 720 
gatattttgt ccggtgtgaa cgtgcgaatt ttagtacgac aaccagttac atcgatgtat 780 
cgatagtttt tagactggcc aatgtttatt gttataaaat aaatgttaaa aataaacaat 840 
taatcactta gtataaacaa aaataagtgc aagcagcaag tgcgttgtta aaacgaccgc 900 
cgccctccga atttccttat tgcgggcatt aaatcgcaac aacaacaacg ccaccagcaa 960 
cagcaacaac aataagcagg agattcagat tcgcaacgca atcaaaacgt aaaaaaaaaa 1020 
cgaaaaataa aaaactacaa ggcgaaacgc ataaataaat agggaagcaa ataatagtaa 1080 
taataagact aaaagcaagt ggaaaagtac gaaagcgaaa agagaaaaca tatgtacgtt 1140 
gcgtgagagt gacgtgtgtg tgtgtgtgtc agtgagcgaa ggagagtgag cgagagaggg 1200 
gaggcgagca aaaacaaaaa caacagcaca gcacacagca taagcgagac cttgaaaact 1260 
ttaaaagcag caaaaaaaac aagacaacag ccaaaaataa gaaactaaaa gctgcaaaag 1320 
taataaaaaa tatattttag ccgaaaaatt tccataataa caattctaga agtgcggagc 1380 
gtacaccctg ttatggagag tgacgatttt catttaccgc aaggcgccaa ttaaagggga 1440 
aaatccataa atcgaggatt acaagtggaa aacaaggagg cagtaactcc agaaaacgcc 1500 
caaaaagtcc aaaatggcag caccagagac gggcaacacg ggctccacag gatccgctgg 1560 
ctcgacagga tcgggatcgg gatcgggatc gggaagtggg agctcctcag atccagcgaa 1620 
tggacgggag gcccgtaacc ttgccgaaaa acagcgacgg gataagctta atgccagcat 1680 
ccaggagctg gccaccatgg taccacatgc agccgaatcc tcccgtcgcc tggacaaaac 1740 
cgccgtcctt agattcgcca cccatggcct gagacttcag tatgtctttg gcaagtccgc 1800 
ttccagacgt cgcaagaaaa ccggcctcaa gggaacgggt atgtctgcct cacctgtcgg 18 60 
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agatctaccc aatcccagtc tgcatctaac ggacactcta atgcaactgc tggactgctg 1920 
cttcctcacc ctaacctgca gtggccaaat cgttttggta tccaccagcg tggagcagct 1980 
attgggtcac tgtcagtccg atttgtatgg ccagaatcta ctgcagatca cgcatcccga 2040 
tgatcaggat ctgttaagac agcagctaat acccagggat atagagaccc tgttctatca 2100 
gcatcagcac caccagcagc aggggcacaa tccccagcag cactccactt ccacgtcggc 2160 
ctcagcttcg ggcagtgatc tggaggagga ggaaatggag acggaggaac accgtctggg 2220 
tcggcagcag ggagaggcgg acgatgacga ggatcacccg tacaacggac gaacacccag 2280 
cccgcggaga atggcccatt tggcgaccat tgatgaccga ctacgcatgg atcggcgctg 2340 
ctttaccgtc cgcttggcta gggcttccac gcgagcggag gccacgcgtc attacgagcg 2400 
ggttaagatc gatggctgct ttcgtcgcag tgactcctcc ttaaccggag gtgccgctgc 24 60 
caactatccg attgtctccc agctgatacg acgctcgaga aacaacaata tgctggctgc 2520 
cgctgcagca gtggcagcag aagcggcgac ggtgccaccc cagcacgatg ccattgccca 2580 
ggcggcgctg cacgggatta gcggcaatga tattgtcctg gtggccatgg ccagggtgct 2640 
gcgagaggaa cggccgcctg aggagacgga gggtacagtg ggcttgacca tttacagaca 2700 
gccagaaccc tatcagctgg agtaccatac gaggcatcta atcgacggca gcatcatcga 2760 
ctgtgatcaa aggattggtc tggtggcggg atatatgaag gatgaggtgg gtatattaac 2820 
atcatctctc tgaactgctt acgacaacta atcgtgtact ctccactcga aacaggtgcg 2880 
caaccttagt cccttctgtt tcatgcacct ggacgacgtt cgctgggtga ttgtggccct 2940 
tcgacaaatg tacgattgca acagtgacta tggcgagagc tgctaccgtc tgctgtcccg 3000 
caacgggcgc ttcatttacc tgcacaccaa gggatttctg gaggtcgacc gtggcagtaa 3060 
taaggtgcat tcctttctgt gcgtcaacac gctgctcgat gaggaggcgg gccggcaaaa 3120 
ggtgcaggag atgaaggaga aattctcgac aatcatcaag gcggagatgc ccacgcagag 3180 
cagcagtccc gatttgcccg cctcgcaggc accgcagcaa cttgagagaa ttgtcctcta 324 0 
tctaatagag aacctacaga agagtgtgga ttcagcagag acggttggcg gccagggcat 3300 
ggaaagccta atggacgatg gctacagttc gccagcaaat accttaactc tcgaggagtt 3360 
agctccctcg cccacgcccg ccttggcctt ggtgccgccg gctccctcat cggtcaagag 3420 
ctccatctcc aagtcggtga gtgtggtcaa tgtgacggcg gccagaaagt ttcagcagga 3480 
gcatcagaag cagcgtgaac gtgaccgtga gcagcttaag gagcgcacca actccacgca 354 0 
gggcgtgatc cggcaactga gcagctgcct aagcgaggcg gaaacggcat cctgtatcct 3600 
atcaccagcc agtagcttga gtgccagcga agcaccggac acgcccgatc cgcacagcaa 3660 
cacatcaccg ccaccgtcgc tccacacacg tcccagtgtc ctgcatcgaa ccctgaccag 3720 
cacgctgcga tgacgggctg atggaacctg gtttgccttc taattgggtg tgtggaaatg 3780 
gacgtaattg gtagctcacg tgcccacaaa cgaattagta tcggtaatat aatcctggcc 3840 
aatcgcaatg tgaaaaccca aaatgtatca gaaaaaaaac gagcattatt caaatagttt 3900 
aaaaattcag ccaaaaaact taaatacgaa aaaaaagagc gtgggttgaa gaaccttttg 3960 
ttttcatatt cacatttcca agctttgagc aatcaaacaa ttttaatttt cagtatacac 4 020 
atatgtataa tgagttggct ttacaaaagc tattaacaaa tcaagcaatt gtgtaattta 4 080 
atatgagact ttccgtgatt tttgctttct acgtactttt cgacttcaat tgatctatag 4140 
ggtttccgta ttaaaaacga aattaacgtg gtttcatttg atgaaaatgc aatatgagct 4200 
cgcatttatt ttgatattat gacagtaata atgatctgat cacgataatc gttttctcaa 4260 
aacataagcg atacattttg ggtacatttg gccattactg tttctgtgtg tgatttcggt 4 320 
ataaaatagt agtttgatta catgttatat tgatgaatgg cgatcggtgg gtgctgctaa 4380 
atgcgttcca ttatcaataa ttttcgttat gtaattacgt ttaatttgta aatatgtatg 4440 
agtgcgagcg tgagtgagtt tgtgatcgtg tcagcatggg tgtgaatgaa cattagatca 4500 
gtgctcggat ttggttttag ttgaaattta aaccccattt ccccgatttc ccagttatca 4560 
ccttccgccc caaaacacca ttgtaaaaag agtacaaaaa aaaaaaraaa agaaaataga 4 620 
aaaacaaaca aacaattata tatttatttc gccctaagtc tagaacgtgc taaacacaac 4680 
tcattaatag ttaaacaaac ggatgttgca atcgatggaa attaaacgct cgcttttagt 4740 
tttgccgtct cgctcgaaga aagaaagagg actactacat atgtacagtc aaactaatcc 4800 
aagtcaaact cttcagtctc agaattggag actttattaa aggtttttta ttttatgaag 4860 
aatagcatat.. tatttttata tatatatatt tatgtatata tatttatatc acaaatctcc 4 920 
ttcgatatcc ctttgtaatc tataaaaaca cttccaggcg cagctttatt tttcaacagt 4980 
tttaattttg ggaactttac aaagcatata agcatacccc aaatccatat ccttataaac 5040 
tttgtagtta ttacaaatgt gttcaaaatt aaaaaaaaaa aagtttgtca tccaaaatcc 5100 
aatccgctaa atacaatata atcattttaa ttgatctatg ttacacttct ataatgtcta 5160 
ctggacaaaa tatgattaag ttgaaatttc aaagtgtttt tgaagtttgt actaacgaag 5220 
tttggctgta gttcaatact cacagaaagg gagcgagaca tattcccatc tcgctccgac 5280 
ctgttatcaa agcctttgcc attgtccttg gcgttgcgtt cttatcaccg gcgtatcgga 5340 
gggaatatat gatttttttt tcggccaagt ggttctagtg gtaaatatga agcggtgggt 5400 
ggtttccggg ggtttggggg aaccgcggtt ggtgatgggc gcagagaagg cgggaagcgg 54 60 
gggggaacac gtttcgttat tttttgtttt atagcgaaat tcctctagct ataaatgtat 5520 
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gcatacatat atgctattaa cttttcgttt agctcaattt tttgtttgta tttttaatta 5580 
taactcggtt tgtctagtgt gtaattaaat taaaagtcca atgctaattc cccagcgaat 564 0 
cgttcagtgt ttgttcaaac gagttgtgct ctttaaacac ttttggctta tcctttcatt 5700 
taatttttgt ttgttttcct tgcgttgtta tatgtccaaa taaaaatgaa agtttgaatt 5760 
gttttttttt agtaatttag ttttaactgt aatgtcaaat gtcgagaggt atactctgta 5820 
taagaataat aatgtgtaat gtaattttta gaacttgcta atctctagca attaaaacat 5880 
agcctgtaat taacaataat taattataat aaccagtaac agttcatgca aatgaaacct 5940 
taataaataa taaattatat aaaagaagat ggaagtgttt tcaatcaatg aagttacttc 6000 
ctattttcag gagttggaat gttacacttt tttataagct gatttgaaat gcgcgccaca 6060 
ttatatctgc cttgcgcgca cgcattgcag ccgaataacg gcgaattaac tcccctcaga 6120 
ggtgagtttg aactatcgat tataatcgat tgcattcgga aacaaaaaac accttgagtt 6180 
tggtcagatc ggttttgcgc gggataaata ataaaatatt aaatgttttc tttg 6234 

<210> 2 
<211> 3011 
<212> DNA 

<213> Met-JHR genomic DNA 



<400> 2 

ccaaaaataa gaaactaaaa gctgcaaaag taataaaaaa tatattttag ccgaaaaatt 60 
tccataataa caattctaga agtgcggagc gtacaccctg ttatggagag tgacgatttt 120 
catttaccgc aaggcgccaa ttaaagggga aaatccataa atcgaggatt acaagtggaa 180 
aacaaggagg cagtaactcc agaaaacgcc caaaaagtcc aaaatggcag caccagagac 240 
gggcaacacg ggctccacag gatccgctgg ctcgacagga tcgggatcgg gatcgggatc 300 
gggaagtggg agctcctcag atccagcgaa tggacgggag gcccgtaacc ttgccgaaaa 360 
acagcgacgg gataagctta atgccagcat ccaggagctg gccaccatgg taccacatgc 420 
agccgaatcc tcccgtcgcc tggacaaaac cgccgtcctt agattcgcca cccatggcct 480 
gagacttcag tatgtctttg gcaagtccgc ttccagacgt cgcaagaaaa ccggcctcaa 540 
gggaacgggt atgtctgcct cacctgtcgg agatctaccc aatcccagtc tgcatctaac 600 
ggacactcta atgcaactgc tggactgctg cttcctcacc ctaacctgca gtggccaaat 660 
cgttttggta tccaccagcg tggagcagct attgggtcac tgtcagtccg atttgtatgg 720 
ccagaatcta ctgcagatca cgcatcccga tgatcaggat ctgttaagac agcagctaat 780 
acccagggat atagagaccc tgttctatca gcatcagcac caccagcagc aggggcacaa 840 
tccccagcag cactccactt ccacgtcggc ctcarcttcg ggcagtgatc tggaggagga 900 
ggaaatggag acggaggaac accgtctggg tcggcagcag ggagaggcgg acgatgacga 960 
ggatcacccg tacaaccgac gaacacccag cccgcggaga atggcccatt tggcgaccat 1020 
tgatgaccga ctacgcatgg atcggcgctg ctttaccgtc cgcttggcta gggcttccac 1080 
gcgagcggag gccacgcgtc attacgagcg ggttaagatc gatggctgct ttcgtcgcag 1140 
tgactcctcc ttaaccggag gtgccgctgc caactatccg attgtctccc agctgatacg 1200 
acgctcgaga aacaacaata tgctggctgc tgctgcagca gtggcagcag aagcggcgac 1260 
ggtgccgccc cagcacgatg ccattgccca ggcggcgctg cacgggatta gcggcaatga 1320 
tattgtcctg gtggccatgg ccagggtgct gcgagaggaa cggccgcctg aggagacgga 1380 
gggtacagtg ggcttgacca tttacagaca gccagaaccc tatcagttgg agtaccatac 1440 
gaggcatcta atcgacggca gcatcatcga ctgtgatcaa aggattggtc tggtggcggg 1500 
atatatgaag gatgaggtgg gtatattaac atcatctctc tgaactgctt acgacaacta 1560 
atcgtgtact ctccactcga aacaggtgcg caaccttagt cccttctgtt tcatgcacct 1620 
ggacgacgtt cgctgggtga ttgtggccct tcgacaaatg tacgattgca acagtgacta 1680 
cggcgagagc tgctaccgtc tgctgtcccg caacgggcgc ttcatttacc tgcacaccaa 1740 
gggatttctg gaggtcgacc gtggcagtaa taaggtgcat tcctttctgt gcgtcaacac 1800 
gctgctcgat gaggaggcgg gccggcaaaa ggtgcaggag atgaaggaga aattctcgac 1860 
aatcatcaag gcggagatgc ccacgcagag cagcagtccc gatttgcccg cctcgcaggc 1920 
accgcagcaa cttgagagaa ttgtcctcta tctaatagag aacctacaga agagtgtgga 1980 
ttcagcagag acggttggcg gccagggcat ggaaagccta atggacgatg gctacagttc 204 0 
gccagcaaat accttaactc tcgaggagtt agctccctcg cccacgcccg ccttggcctt 2100 
ggtgccgccg gctccctcat cggtcaagag ctccatctcc aagtcggtga gtgtggtcaa 2160 
tgtgacggcg gccagaaagt ttcagcagga gcatcagaag cagcgtgaac gtgaccgtga 2220 
gcagcttaag gagcgcacca actccacgca gggcgtgatc cggcaactga gcagctgcct 2280 
aagcgaggcg gaaacggcat cctgtatcct atcaccagcc agtagcttga gtgccagcga 234 0 
agcaccggac acgcccgatc cgcacagcaa cacatcaccg ccaccgtcgc tccacacacg 24 00 
tcccagtgtc ctgcatcgaa ccctgaccag cacgctgcga tgacgggctg atggaacctg 24 60 
gtttgccttc taattgggtg tgtggaaatg gacgtaattg gtagctcacg tgcccacaaa 2520 
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cgaattagta tcggtaatat aatcctggcc aatcgcaatg tgaaaaccca aaatgtatca 2580 
gaaaaaaaac gagcattatt caaatagttt aaaaattcag ccaaaaaact taaaaacgaa 2640 
aaaaaagagc gtgggttgaa gaaccttttg ttttcatatt cacatttcca agctttgagc 2700 
aatcaaacaa ttttaatttt cagtatacac atatgtataa tgagttggct ttacaaaagc 2760 
tattaacaaa tcaagcaatt gtgtaattta atatgagact ttccgtgatt tttgctttct 2820 
acgtactttt cgacttcaat tgatctatag ggtttccgta ttaaaaacga aattaacgtg 2880 
gtttcatttg atgaaaatgc aatatgagct cgcatttatt ttgatattat gacagtaata 2940 
atgatctgat cacgataatc gttttctcaa aacataagcg atacattttg ggtacatttg 3000 
gccattactg t 3011 

<210> 3 
<211> 3282 
<212> DNA 

<213> Met-JHR cDNA 



<400> 3 

aaataagaaa ctaaaagctg caaaagtaat aaaaaatata ttttagccga aaaatttcca 60 
taataacaat tctagaagtg cggagcgtac accctgttat ggagagtgac gattttcatt 120 
taccgcaagg cgccaattaa aggggaaaat ccataaatcg aggattacaa gtggaaaaca 180 
aggaggcagt aactccagaa aacgcccaaa aagtccaaaa tggcagcacc agagacgggc 240 
aacacgggct ccacaggatc cgctggctcg acaggatcgg gatcgggatc gggatcggga 300 
agtgggagct cctcagatcc agcgaatgga cgggaggccc gtaaccttgc cgaaaaacag 360 
cgacgggata agcttaatgc cagcatccag gagctggcta ccatggtacc acatgcagcc 420 
gaatcctccc gtcgcctgga caaaaccgcc gtcctcagat ttgccaccca tggcctgaga 480 
cttcagtatg tctttggcaa gtccgcttcc agacgtcgca agaaamccgg cctcaaggga 54 0 
acgggtatgt ctgcctcacc tgtcggagat ctacccaatc ccagtctgca tctaacggac 600 
actctaatgc aactgctgga ctgctgcttc ctcaccctaa cctgcagtgg ccaaatcgtt 660 
ttggtatcca ccagcgtgga gcagctattg ggtcactgtc agtccgattt gtatggccag 720 
aatctactgc agatcacgca tcccgatgat caggatctgt taagacagca gctaataccc 780 
agggatatag agaccctgtt ctatcagcat cagcaccacc agcagcaggg gcacaatccc 84 0 
cagcagcact ccacttccac gtcggcctca gcttcgggca gtgatctgga ggaggaggaa 900 
atggagacgg aggaacaccg tctgggtcgg cagcagggag aggcggacga tgacgaggat 960 
cacccgtaca accgacgaac acccagcccg cggagaatgg cccatttggc gaccattgat 1020 
gaccgactac gcatggattg gcgctgcttt accgtccgct tggctagggc ttccacgcga 1080 
gcggaggcca cgcgtcatta cgagcgggtt aagatcgatg gctgctttcg tcgcagtgac 1140 
tcctccttaa ccggaggtgc cgctgccaac tatccgattg tctcccagct gatacgacgc 1200 
tcgagaaaca acaatatgct ggctgccgct gcagcagtgg cagcagaagc ggcgacggtg 1260 
ccaccccagc acgatgccat tgcccaggcg gcgctgcacg ggattagcgg caatgatatt 1320 
gtcctggtgg ccatggccag ggtgctgcga gaggaacggc cgcctgagga gacggagggt 1380 
acagtgggct tgaccattta cagacagcca gaaccctatc agctggagta ccatacgagg 1440 
catctaatcg acggcagcat catcgactgt gatcaaagga ttggtctggt ggcgggatat 1500 
atgaaggatg aggtgcgcaa ccttagtccc ttctgtttca tgcacctgga cgacgttcgc 1560 
tgggtgattg tggcccttcg acaaatgtac gattgcaaca gtgactatgg cgagagctgc 1620 
taccgtctgc tgtcccgcaa cgggcgcttc atttacctgc acaccaaggg atttctggag 1680 
gtcgaccgtg gcagtaataa ggtgcattcc tttctgtgcg tcaacacgct gctcgatgag 1740 
gaggcgggcc ggcaaaaggt gcaggagatg aaggagaaat tctcgacaat catcaaggcg 1800 
gagatgccca cgcagagcag cagtcccgat ttgcccgcct cgcaggcacc gcagcaactt 18 60 
gagagaattg tcctctatct aatagagaac ctacagaaga gtgtggattc agcagagacg 1920 
gttggcggcc agggcatgga aagcctaatg gacgatggct acagttcgcc agcaaatacc 1980 
ttaactctcg aggagttagc tccctcgccc acgcccgcct tggccttggt gccgccggct 2040 
ccctcatcgg tcaagagctc catctccaag tcggtgagtg tggtcaatgt gacggcggcc 2100 
agaaagtttc agcaggagca tcagaagcag cgtgaacgtg accgtgagca gcttaaggag 2160 
cgcaccaact ccacgcaggg cgtgatccgg caactgagca gctgcctaag cgaggcggaa 2220 
acggcatcct gtatcctatc accagccagt agcttgagtg ccagcgaagc accggacacg 2280 
cccgatccgc acagcaacac atcaccgcca ccgtcgctcc acacacgtcc cagtgtcctg 2340 
catcgaaccc tgaccagcac gctgcgatga cgggctgatg gaacctggtt tgccttctaa 2400 
ttgggtgtgt ggaaatggac gtaattggta gctcacgtgc ccacaaacga attagtatcg 24 60 
gtaatataat cctggccaat cgcaatgtga aaacccaaaa tgtatcagaa aaaaaacgag 2520 
cattattcaa atagtttaaa aattcagcca aaaaacttaa aaacgaaaaa aaagagcgtg 2580 
ggttgaagaa ccttttgttt tcatattcac atttccaagc tttgagcaat caaacaattt 2640 
taattttcag tatacacata tgtataatga gttggcttta caaaagctat taacaaatca 2700 
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3240 
3282 



agcaattgtg taatttaata tgagactttc cgtgattttt gctttctacg tacttttcga 2760 
cttcaattga tctatagggt ttccgtatta aaaacgaaat taacgtggtt tcatttgatg 2820 
aaaatgcaat atgagctcgc atttattttg atattatgac agtaataatg atctgatcac 2880 
gataatcgtt ttctcaaaac ataagcgata cattttgggt acatttggcc attactgttt 2940 
ctgtgtgtga tttcggtata aaatagtagt ttgattacat gttatattga tgaatggcga 3000 
tcggtgggtg ctgctaaatg cgttccatta tcaataattt tcgttatgta attacgttta 3060 
atttgtaaat atgtatgagt gcgagcgtga gtgagtttgt gatcgtgtca gcatgggtgt 3120 
gaatgaacat tagatcagtg ctcggatttg gttttagttg aaatttaaac cccatttccc 3180 
cgatttccca gttatcacct tccgccccaa aacaccattg taaaaagagt acaaaaaaaa 
aaagaaaaga aaatagaaaa acaaaaaaaa aaaaaaaaaa aa 

<210> 4 
<211> 737 
<212> PRT 
<213> Met-JHR 

<220> 

<223> Xaa at position 218 may be Thr or Pro 
<400> 4 

Met Ala Ala Pro Glu Thr Gly Asn Thr Gly Ser Thr Gly Ser Ala Gly 
15 10 15 

Ser Thr Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Ser Ser 

20 25 30 

Asp Pro Ala Asn Gly Arg Glu Ala Arg Asn Leu Ala Glu Lys Gin Arg 
35 40 45 

Arg Asp Lys Leu Asn Ala Ser lie Gin Glu Leu Ala Thr Met Val Pro 
50 55 60 

His Ala Ala Glu Ser Ser Arg Arg Leu Asp Lys Thr Ala Val Leu Arg 
65 70 75 80 

Phe Ala Thr His Gly Leu Arg Leu Gin Tyr Val Phe Gly Lys Ser Ala 

85 90 95 

Ser Arg Arg Arg Lys Lys Thr Gly Leu Lys Gly Thr Gly Met Ser Ala 

100 105 HO 

Ser Pro Val Gly Asp Leu Pro Asn Pro Ser Leu His Leu Thr Asp Thr 
115 120 125 

Leu Met Gin Leu Leu Asp Cys Cys Phe Leu Thr Leu Thr Cys Ser Gly 
130 135 140 

Gin lie Val Leu Val Ser Thr Ser Val Glu Gin Leu Leu Gly His Cys 
145 150 155 160 

Gin Ser Asp Leu Tyr Gly Gin Asn Leu Leu Gin lie Thr His Pro Asp 

165 170 175 

Asp Gin Asp Leu Leu Arg Gin Gin Leu lie Pro Arg Asp lie Glu Thr 

180 185 190 

Leu Phe Tyr Gin His Gin His His Gin Gin Gin Gly His Asn Pro Gin 
195 200 205 

Gin His Ser Thr Ser Thr Ser Ala Ser Xaa Ser Gly Ser Asp Leu Glu 
210 215 220 
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Glu Glu Glu Met Glu Thr Glu Glu His Arg Leu Gly Arg Gin Gin Gly 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp His Pro Tyr Asn Arg Arg Thr Pro Ser 

245 250 255 

Pro Arg Arg Met Ala His Leu Ala Thr lie Asp Asp Arg Leu Arg Met 

260 265 270 

Asp Arg Arg Cys Phe Thr Val Arg Leu Ala Arg Ala Ser Thr Arg Ala 
275 280 285 

Glu Ala Thr Arg His Tyr Glu Arg Val Lys lie Asp Gly Cys Phe Arg 
290 295 300 

Arg Ser Asp Ser Ser Leu Thr Gly Gly Ala Ala Ala Asn Tyr "Pro lie 
305 310 315 320 

Val Ser Gin Leu lie Arg Arg Ser Arg Asn Asn Asn Met Leu Ala Ala 

325 330 335 

Ala Ala Ala Val Ala Ala Glu Ala Ala Thr Val Pro Pro Gin His Asp 

340 345 350 

Ala lie Ala Gin Ala Ala Leu His Gly lie Ser Gly Asn Asp lie Val 
355 360 365 

Leu Val Ala Met Ala Arg Val Leu Arg Glu Glu Arg Pro Pro Glu Glu 
370 375 380 

Thr Glu Gly Thr Val Gly Leu Thr lie Tyr Arg Gin Pro Glu Pro Tyr 
385 390 395 400 

Gin Leu Glu Tyr His Thr Arg His Leu lie Asp Gly Ser lie lie Asp 

405 410 415 

Cys Asp Gin Arg lie Gly Leu Val Ala Gly Tyr Met Lys Asp Glu Val 

420 425 430 

Gly lie Leu Thr Ser Ser Leu Thr Ala Tyr Asp Asn Ser Cys Thr Leu 
435 440 445 

His Ser Lys Gin Val Arg Asn Leu Ser Pro Phe Cys Phe Met His Leu 
450 455 460 

Asp Asp Val Arg Trp Val lie Val Ala Leu Arg Gin Met Tyr Asp Cys 
465 470 475 480 

Asn Ser Asp Tyr Gly Glu Ser Cys Tyr Arg Leu Leu Ser Arg Asn Gly 

485 490 495 

Arg Phe lie Tyr Leu His Thr Lys Gly Phe Leu Glu Val Asp Arg Gly 

500 505 510 

Ser Asn Lys Val His Ser Phe Leu Cys Val Asn Thr Leu Leu Asp Glu 
515 520 525 

Glu Ala Gly Arg Gin Lys Val Gin Glu Met Lys Glu Lys Phe Ser Thr 
530 535 540 
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lie lie Lys Ala Glu Met Pro Thr Gin Ser Ser Ser Pro Asp Leu Pro 
545 550 555 560 

Ala Ser Gin Ala Pro Gin Gin Leu Glu Arg lie Val Leu Tyr Leu He 

565 570 575 

Glu Asn Leu Gin Lys Ser Val Asp Ser Ala Glu Thr Val Gly Gly Gin 

580 585 590 

Gly Met Glu Ser Leu Met Asp Asp Gly Tyr Ser Ser Pro Ala Asn Thr 
595 600 605 

Leu Thr Leu Glu Glu Leu Ala Pro Ser Pro Thr Pro Ala Leu Ala Leu 
610 615 620 

Val Pro Pro Ala Pro Ser Ser Val Lys Ser Ser He Ser Lys Ser Val 
625 630 635 640 

Ser Val Val Asn Val Thr Ala Ala Arg Lys Phe Gin Gin Glu His Gin 

645 650 655 

Lys Gin Arg Glu Arg Asp Arg Glu Gin Leu Lys Glu Arg Thr Asn Ser 

660 665 670 

Thr Gin Gly Val He Arg Gin Leu Ser Ser Cys Leu Ser Glu Ala Glu 
675 680 685 

Thr Ala Ser Cys He Leu Ser Pro Ala Ser Ser Leu Ser Ala Ser Glu 
690 695 700 

Ala Pro Asp Thr Pro Asp Pro His Ser Asn Thr Ser Pro Pro Pro Ser 
705 710 715 720 

Leu His Thr Arg Pro Ser Val Leu His Arg Thr Leu Thr Ser Thr Leu 

725 730 735 

Arg 



<210> 5 
<211> 716 
<212> PRT 
<213> Met-JHR 



<220> 

<223> Xaa at position 103 may be Gly or Arg 
<400> 5 

Met Ala Ala Pro Glu Thr Gly Asn Thr Gly Ser Thr Gly Ser Ala Gly 
15 10 15 

Ser Thr Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Ser Ser 

20 25 30 

Asp Pro Ala Asn Gly Arg Glu Ala Arg Asn Leu Ala Glu Lys Gin Arg 
35 40 45 

Arg Asp Lys Leu Asn Ala Ser He Gin Glu Leu Ala Thr Met Val Pro 
50 55 60 
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His Ala Ala Glu Ser Ser Arg Arg Leu Asp Lys Thr Ala Val Leu Arg 
65 70 75 80 

Phe Ala Thr His Gly Leu Arg Leu Gin Tyr Val Phe Gly Lys Ser Ala 

85 90 95 

Ser Arg Arg Arg Lys Lys Xaa Gly Leu Lys Gly Thr Gly Met Ser Ala 

100 105 HO 

Ser Pro Val Gly Asp Leu Pro Asn Pro Ser Leu His Leu Thr Asp Thr 
115 120 125 

Leu Met Gin Leu Leu Asp Cys Cys Phe Leu Thr Leu Thr Cys Ser GJ.y * 
130 135 140 

Gin lie Val Leu Val Ser Thr Ser Val Glu Gin Leu Leu Gly .His Cys 
1^5 150 155 160 

Gin Ser Asp Leu Tyr Gly Gin Asn Leu Leu Gin He Thr His Pro Asp 

165 170 175 

Asp Gin Asp Leu Leu Arg Gin Gin Leu He Pro Arg Asp He Glu Thr 

180 185 190 

Leu Phe Tyr Gin His Gin His His Gin Gin Gin Gly His Asn Pro Gin 
195 200 205 

Gin His Ser Thr Ser Thr Ser Ala Ser Ala Ser Gly Ser Asp Leu Glu 
210 215 220 

Glu Glu Glu Met Glu Thr Glu Glu His Arg Leu Gly Arg Gin Gin Gly 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp His Pro Tyr Asn Arg Arg Thr Pro Ser 

245 250 255 

Pro Arg Arg Met Ala His Leu Ala Thr He Asp Asp Arg Leu Arg Met 

260 265 270 

Asp Trp Arg Cys Phe Thr Val Arg Leu Ala Arg Ala Ser Thr Arg Ala 
275 280 285 

Glu Ala Thr Arg His Tyr Glu Arg Val Lys He Asp Gly Cys Phe Arg 
290 295 300 

Arg Ser Asp Ser Ser Leu Thr Gly Gly Ala Ala Ala Asn Tyr Pro He 
305 310 315 320 

Val Ser Gin Leu He Arg Arg Ser Arg Asn Asn Asn Met Leu Ala Ala 

325 330 335 

Ala Ala Ala Val Ala Ala Glu Ala Ala Thr Val Pro Pro Gin His Asp 

340 345 350 

Ala He Ala Gin Ala Ala Leu His Gly He Ser Gly Asn Asp He Val 
355 360 365 

Leu Val Ala Met Ala Arg Val Leu Arg Glu Glu Arg Pro Pro Glu Glu 
370 375 380 
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Thr Glu Gly Thr Val Gly Leu Thr lie Tyr Arg Gin Pro Glu Pro Tyr 
385 390 395 400 

Gin Leu Glu Tyr His Thr Arg His Leu lie Asp Gly Ser lie lie Asp 

405 410 415 

Cys Asp Gin Arg lie Gly Leu Val Ala Gly Tyr Met Lys Asp Glu Val 

420 425 430 

Arg Asn Leu Ser Pro Phe Cys Phe Met His Leu Asp Asp Val Arg Trp 
435 440 445 

Val lie Val Ala Leu Arg Gin Met Tyr Asp Cys Asn Ser Asp Tyr Gly 
450 455 460 

Glu Ser Cys Tyr Arg Leu Leu Ser Arg Asn Gly Arg Phe lie Tyr Leu 
465 470 475 480 

His Thr Lys Gly Phe Leu Glu Val Asp Arg Gly Ser Asn Lys Val His 

485 490 495 

Ser Phe Leu Cys Val Asn Thr Leu Leu Asp Glu Glu Ala Gly Arg Gin 

500 505 510 

Lys Val Gin Glu Met Lys Glu Lys Phe Ser Thr lie lie Lys Ala Glu 
515 520 525 

Met Pro Thr Gin Ser Ser Ser Pro Asp Leu Pro Ala Ser Gin Ala Pro 
530 535 540 

Gin Gin Leu Glu Arg lie Val Leu Tyr Leu lie Glu Asn Leu Gin Lys 
545 550 555 560 

Ser Val Asp Ser Ala Glu Thr Val Gly Gly Gin Gly Met Glu Ser Leu 

565 570 575 

Met Asp Asp Gly Tyr Ser Ser Pro Ala Asn Thr Leu Thr Leu Glu Glu 

580 585 590 

Leu Ala Pro Ser Pro Thr Pro Ala Leu Ala Leu Val Pro Pro Ala Pro 
595 600 605 

Ser Ser Val Lys Ser Ser lie Ser Lys Ser Val Ser Val Val Asn Val 
610 615 620 

Thr Ala Ala Arg Lys Phe Gin Gin Glu His Gin Lys Gin Arg Glu Arg 
625 630 635 640 

Asp Arg Glu Gin Leu Lys Glu Arg Thr Asn Ser Thr Gin Gly Val lie 

645 650 655 

Arg Gin Leu Ser Ser Cys Leu Ser Glu Ala Glu Thr Ala Ser Cys lie 

660 665 670 

Leu Ser Pro Ala Ser Ser Leu Ser Ala Ser Glu Ala Pro Asp Thr Pro 
675 680 685 



Asp Pro His Ser Asn Thr Ser Pro Pro Pro Ser Leu His Thr Arg Pro 
690 695 700 
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Ser 
705 



Val Leu Hi 



Thr 
710 



10 



Leu Thr Ser Thr 



Leu 
715 



Arg 
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<210> 6 
<211> 250 
<212> DNA 

<213> D. melanogaster 
<400> 6 

atggcagcac cagagacggg caacacgggc tccacaggat ccgctggctc gacaggatcg 60 
ggatcgggat cgggatcggg aagtgggagc tcctcagatc cagcgaatgg acgggaggcc 120 
cgtaaccttg ccgaaaaaca gcgacgggat aagcttaatg ccagcatcca ggagctggct 180 
accatggtac cacatgcagc cgaatcctcc cgtcgcctgg acaaaaccgc cgtccttaga 240 
ttcgccaccc 250 

<210> 7 
<211> 232 
<212> DNA 
<213> D . erecta 

<400> 7 

cagcagacgc gggcaacacg ggcaccacag gatcagctgg gtccacagga tcgggatcgg 60 

gaactgggac gtccgcagat ccagcgaatg gacgggaggc ccgcaatctt gccgagaaac 120 

agcgacggga taagcttaat gccagcatcc aggagctggc taccatggta ccacatgtca 180 

gccgaatcct cccgacgcct ggacaaaacc gccgtcctca gattcgccac cc 232 

<210> 8 

<211> 631 

<212> PRT 

<213> Drosophila 

<400> 8 

Met Asp Glu Ala Asn lie Gin Asp Lys Glu Arg Phe Ala Ser Arg Glu 
15 10 15 

Asn His Cys Glu lie Glu Arg Arg Arg Arg Asn Lys Met Thr Ala Tyr 

20 25 30 

lie Thr Glu Leu Ser Asp Met Val Pro Thr Cys Ser Ala Leu Ala Arg 
35 40 45 

Lys Pro Asp Lys Leu Thr lie Leu Arg Met Ala Val Ala His Met Lys 
50 55 60 

Ala Leu Arg Gly Thr Gly Asn Thr Ser Ser Asp Gly Thr Tyr Lys Pro 
65 70 75 80 

Ser Phe Leu Thr Asp Gin Glu Leu Lys His Leu lie Leu Glu Ala Ala 

85 90 95 

Asp Gly Phe Leu Phe Val Val Ser Cys Asp Ser Gly Arg Val lie Tyr 

100 105 110 

Val Ser Asp Ser Val Thr Pro Val Leu Asn Tyr Thr Gin Ser Asp Trp 
115 120 125 

Tyr Gly Thr Ser Leu Tyr Glu His lie His Pro Asp Asp Arg Glu Lys 
130 135 140 
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He Arg Glu Gin Leu Ser Thr Gin Glu Ser Gin Asn Ala Gly Arg He 
145 150 155 160 

Leu Asp Leu Lys Ser Gly Thr Val Lys Lys Glu Gly His Gin Ser Ser 

165 170 175 

Met Arg Leu Ser Met Gly Ala Arg Arg Gly Phe He Cys Met Arg Val 

180 185 190 

Gly Asn Val Asn Pro Glu Ser Met Val Ser Gly His Leu Asn Arg Leu 
195 200 205 

Lys Gin Arg Asn Ser Leu Gly Pro Ser Arg Asp Gly Thr Asn Tyr Ala 
210 215 220 

Val Val His Cys Thr Gly Tyr He Lys Asn Trp Pro Pro Thr Asp Met 
225 230 235 240 

Phe Pro Asn Met His Met Glu Arg Asp Val Asp Asp Met Ser Ser His 

245 250 255 

Cys Cys Leu Val Ala He Gly Arg Leu Gin Val Thr Ser Thr Ala Ala 

260 265 270 

Asn Asp Met Ser Gly Ser Asn Asn Gin Ser Glu Phe He Thr Arg His 
275 280 285 

Ala Met Asp Gly Lys Phe Thr Phe Val Asp Gin Arg Val Leu Asn He 
290 295 300 

Leu Gly Tyr Thr Pro Thr Glu Leu Leu Gly Lys He Cys Tyr Asp Phe 
305 310 315 320 

Phe His Pro Glu Asp Gin Ser His Met Lys Glu Ser Phe Asp Gin Val 

325 330 335 

Leu Lys Gin Lys Gly Gin Met Phe Ser Leu Leu Tyr Arg Ala Arg Ala 

340 345 350 

Lys Asn Ser Glu Tyr Tyr Val Trp Leu Arg Thr Gin Ala Tyr Ala Phe 
355 360 365 

Leu Asn Pro Tyr Thr Asp Glu Val Glu Tyr He Val Cys Thr Asn Ser 
370 375 380 

Ser Gly Lys Thr Met His Gly Ala Pro Leu Asp Ala Ala Ala Ala His 
385 390 395 400 

Thr Pro Glu Gin Val Gin Gin Gin Gin Gin Gin Gin Gin Glu Gin His 

405 410 415 

Val Tyr Val Gin Ala Ala Pro Gly Val Asp Tyr Ala Arg Arg Glu Leu 

420 425 430 

Thr Pro Val Gly Ser Ala Thr Asn Asp Gly Met Tyr Gin Thr His Met 
435 440 445 

Leu Ala Met Gin Ala Pro Thr Pro Gin Gin Gin Gin Gin Gin Gin Gin 
450 455 460 
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Arg Pro Gly Ser Ala Gin Thr Thr Pro Val Gly Tyr Thr Tyr Asp Thr 
465 470 475 480 

Thr His Ser Pro Tyr Ser Ala Gly Gly Thr Ser Pro Leu Ala Lys lie 

485 490 495 

Pro Lys Ser Gly Thr Ser Pro Thr Pro Val Ala Pro Asn Ser Trp Ala 

500 505 510 

Ala Leu Arg Pro Gin Gin Gin Gin Gin Gin Gin Gin Pro Val Thr Glu 
515 520 525 

Gly Tyr Gin Tyr Gin Gin Thr Ser Pro Ala Arg Ser Pro Ser Gly Pro 
530 535 540 

Thr Tyr Thr Gin Leu Ser Ala Gly Asn Gly Asn Arg Gin Gin .Ala Gin 
545 550 555 560 

Pro Gly Ala Tyr Gin Ala Gly Pro Pro Pro Pro Pro Asn Ala Pro Gly 

565 570 575 

Met Trp Asp Trp Gin Gin Ala Gly Gly His Pro His Pro Pro His Pro 

580 585 590 

Thr Ala His Pro His His Pro His Ala His Pro Gly Gly Pro Ala Gly 
595 600 605 

Ala Gly Gin Pro Gin Gly Gin Gly Val Leu Arg Tyr Ala Ala Asp Val 
610 615 620 

Gly Ser His Ala Asp His Val 
625 630 



<210> 9 
<211> 789 
<212> PRT 
<213> human 

<400> 9 

Met Ala Ala Thr 
1 

Leu Gly Pro Ala 

20 

Gly Gly Ala lie 
35 

Phe Asp Asp Asp 
50 

Asp Gin Met Ser 
65 

Gin Ser Ser Ala 



lie Glu Arg Arg 

100 



Thr Ala Asn Pro 
5 

lie Ala Ser Gly 



Val Gin Arg Ala 

40 

Gly Glu Gly Asn 

- - 55 

Asn Asp Lys Glu 
70 

Asp Lys Glu Arg 
85 

Arg Arg Asn Lys 



Glu Met Thr Ser 
10 

Asn Ser Gly Pro 
25 

lie Lys Arg Arg 



Ser Lys Phe Leu 

60 

Arg Phe Ala Arg 
75 

Leu Ala Arg Glu 
90 

Met Thr Ala Tyr 

105 



Asp Val Pro Ser 
15 

Gly He Gin Gly 
30 

Pro Gly Leu Asp 
45 

Arg Cys Asp Asp 



Ser Asp Asp Glu 

80 

Asn His Ser Glu 
95 

He Thr Glu Leu 
110 
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Ser Asp Met Va^fco Thr Cys Ser Ala Leu Ala Arg Pro Asp Lys 

115 120 125 

Leu Thr lie Leu Arg Met Ala Val Ser His Met Lys Ser Leu Arg Gly 
130 135 140 

Thr Gly Asn Thr Ser Thr Asp Gly Ser Tyr Lys Pro Ser Phe Leu Thr 
145 150 155 160 

Asp Gin Glu Leu Lys His Leu lie Leu Glu Ala Ala Asp Gly Phe Leu 

165 170 175 

Phe lie Val Ser Cys Glu Thr Gly Arg Val Val Tyr Val Ser Asp Ser 

180 185 190 

Val Thr Pro Val Leu Asn Gin Pro Gin Ser Glu Trp Phe Gly Ser Thr 
195 200 205 

Leu Tyr Asp Gin Val His Pro Asp Asp Val Asp Lys Leu Arg Glu Gin 
210 215 220 

Leu Ser Thr Ser Glu Asn Ala Leu Thr Gly Arg lie Leu Asp Leu Lys 
225 230 235 240 

Thr Gly Thr Val Lys Lys Glu Gly Gin Gin Ser Ser Met Arg Met Cys 

245 250 255 

Met Gly Ser Arg Arg Ser Phe lie Cys Arg Met Arg Cys Gly Ser Ser 

260 265 270 

Ser Val Asp Pro Val Ser Val Asn Arg Leu Ser Phe Val Arg Asn Arg 
275 280 285 

Cys Arg Asn Gly Leu Gly Ser Val Lys Asp Gly Glu Pro His Phe Val 
290 295 300 

Val Val His Cys Thr Gly Tyr lie Lys Ala Trp Pro Pro Ala Gly Val 
305 310 315 320 

Ser Leu Pro Asp Asp Asp Pro Glu Ala Gly Gin Gly Ser Lys Phe Cys 

325 330 335 

Leu Val Ala lie Gly Arg Leu Gin Val Thr Ser Ser Pro Asn Cys Thr 

340 345 350 

Asp Met Ser Asn Val Cys Gin Pro Thr Glu Phe lie Ser Arg His Asn 
355 360 365 

lie Glu Gly lie Phe Thr Phe Val Asp His Arg Cys Val Ala Thr Val 
370 375 380 

Gly Tyr Gin Pro Gin Glu Leu Leu Gly Lys Asn lie Val Glu Phe Cys 
385 390 395 400 

His Pro Glu Asp Gin Gin Leu Leu Arg Asp Ser Phe Gin Gin Val Val 

405 410 415 

Lys Leu Lys Gly Gin Val Leu Ser Val Met Phe Arg Phe Arg Ser Lys 

420 425 430 
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Asn Gin Glu Trp Leu Trp Met Arg Thr Ser Ser Phe Thr Phe Gin Asn 
435 440 445 

Pro Tyr Ser Asp Glu lie Glu Tyr lie lie Cys Thr Asn Thr Asn Val 
450 455 460 

Lys Asn Ser Ser Gin Glu Pro Arg Pro Thr Leu Ser Asn Thr lie Gin 
465 470 475 480 

Arg Pro Gin Leu Gly Pro Thr Ala Asn Leu Pro Leu Glu Met Gly Ser 

485 490 495 

Gly Gin Leu Ala Pro Arg Gin Gin Gin Gin Gin Thr Glu Leu Asp Met 

500 505 510 

Val Pro Gly Arg Asp Gly Leu Ala Ser Tyr Asn His Ser Gin Val Val 
515 520 525 

Gin Pro Val Thr Thr Thr Gly Pro Glu His Ser Lys Pro Leu Glu Lys 
530 535 540 

Ser Asp Gly Leu Phe Ala Gin Asp Arg Asp Pro Arg Phe Ser Glu lie 
545 550 555 560 

Tyr His Asn lie Asn Ala Asp Gin Ser Lys Gly lie Ser Ser Ser Thr 

565 570 575 

Val Pro Ala Thr Gin Gin Leu Phe Ser Gin Gly Asn Thr Phe Pro Pro 

580 585 590 

Thr Pro Arg Pro Ala Glu Asn Phe Arg Asn Ser Gly Leu Ala Pro Pro 
595 600 605 

Val Thr lie Val Gin Pro Ser Ala Ser Ala Gly Gin Met Leu Ala Gin 
610 615 620 

lie Ser Arg His Ser Asn Pro Thr Gin Gly Ala Thr Pro Thr Trp Thr 
625 630 635 640 

Pro Thr Thr Arg Ser Gly Phe Ser Ala Gin Gin Val Ala Thr Gin Ala 

645 650 655 

Thr Ala Lys Thr Arg Thr Ser Gin Phe Gly Val Gly Ser Phe Gin Thr 

660 665 670 

Pro Ser Ser Phe Ser Ser Met Ser Leu Pro Gly Ala Pro Thr Ala Ser 
675 680 685 

Pro Gly Ala Ala Ala Tyr Pro Ser Leu Thr Asn Arg Gly Ser Asn Phe 
690 695 700 

Ala Pro Glu Thr Gly Gin Thr Ala Gly Gin Phe Gin Thr Arg Thr Ala 
705 710 715 720 

Glu Gly Val Gly Val Trp Pro Gin Trp Gin Gly Gin Gin Pro His His 

725 730 735 



Arg Ser Ser Ser Ser Glu Gin His Val Gin Gin Pro Pro Ala Gin Gin 

740 745 750 
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Pro Gly Gin Pro Glu Val Phe 
755 

Gin Ser Asn Ser Tyr Asn Asn 
770 775 



15 

Gin Glu Met Leu Ser Met Leu 
760 765 

Glu Glu Phe Pro Asp Leu Thr 

780 
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Gly Asp 
Met Phe 



Pro Pro Phe Ser Glu 
785 



<210> 10 
<211> 626 
<212> PRT 
<213> brain and 

<400> 10 
Met Ala Asp Gin 
1 

Ser Pro Gly Pro 

20 

Val Asp Cys Asn 
35 

Ser Met Asp Thr 
50 

Glu His Gin Gly 
65 

Glu Lys Arg Arg 



Ser Leu Val Pro 

100 

Thr Val Leu Arg 
115 

Thr Asn Pro Tyr 
130 

Asp Glu Leu Lys 
145 

Val Val Gly Cys 



Phe Lys lie Leu 

180 

Phe Asp Tyr Leu 
195 

Ser Ser Ser Asp 
210 

Gly Leu Pro Val 

225 



muscle ARNT-like 



Arg Met Asp lie 
5 

Thr Asp Leu Leu 



Arg Lys Arg Lys 

40 

Asp Lys Asp Asp 
55 

Arg lie Lys Asn 
70 

Arg Asp Lys Met 
85 

Thr Cys Asn Ala 



Met Ala Val Gin 

120 

Thr Glu Ala Asn 
135 

His Leu lie Leu 
150 

Asp Arg Gly Lys 
165 

Asn Tyr Ser Gin 



His Pro Lys Asp 

200 

Thr Ala Pro Arg 
215 

Lys Thr Asp lie 

230 



protein a 



Ser Ser Thr lie 
10 

Ser Ser Ser Leu 
25 

Gly Ser Ser Thr 



Pro His Gly Arg 

60 

Ala Arg Glu Ala 
75 

Asn Ser Phe lie 
90 

Met Ser Arg Lys 
105 

His Met Arg Thr 



Tyr Lys Pro Thr 

140 

Arg Ala Ala Asp 
155 

lie Leu Phe Val 
170 

Asn Asp Leu lie 
185 

lie Ala Lys Val 



Glu Arg Leu lie 

220 

Thr Pro Gly Pro 

235 



Ser Asp Phe Met 
15 

Gly Thr Ser Gly 
30 

Asp Tyr Gin Glu 
45 

Leu Glu Tyr Thr 



His Ser Gin He 

80 

Asp Glu Leu Ala 
95 

Leu Asp Lys Leu 
110 

Leu Arg Gly Ala 
125 

Phe Leu Ser Asp 



Gly Phe Leu Phe 

160 

Ser Glu Ser Val 
175 

Gly Gin Ser Leu 
190 

Lys Glu Gin Leu 
205 

Asp Ala Lys Thr 



Ser Arg Leu Cys 

240 
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r^^rg Ser Phe Phe Cys Arg Met Lys^^ 
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Ser Gly Ala Ar^^rg Ser Phe Phe Cys Arg Met Lys^ys Asn Arg Pro 

245 250 255 

Ser Val Lys Val Glu Asp Lys Asp Phe Pro Ser Thr Cys Ser Lys Lys 

260 265 270* 

Lys Ala Asp Arg Lys Ser Phe Cys Thr lie His Ser Thr Gly Tyr Leu 
275 280 285 

Lys Ser Trp Pro Pro Thr Lys Met Gly Leu Asp Glu Asp Asn Glu Pro 
290 295 300 

Asp Asn Glu Gly Cys Asn Leu Ser Cys Leu Val Ala lie Gly Arg Leu 
305 310 315 320 

His Ser His Val Val Pro Gin Pro Val Asn Gly Glu lie Arg Val Lys 

325 330 335 

Ser Met Glu Tyr Val Ser Arg His Ala lie Asp Gly Lys Phe Val Phe 

340 345 350 

Val Asp Gin Arg Ala Thr Ala lie Leu Ala Tyr Leu Pro Gin Glu Leu 
355 360 365 

Leu Gly Thr Ser Cys Tyr Glu Tyr Phe His Gin Asp Asp lie Gly His 
370 375 380 

Leu Ala Glu Cys His Arg Gin Val Leu Gin Thr Arg Glu Lys lie Thr 
385 390 395 400 

Thr Asn Cys Tyr Lys Phe Lys lie Lys Asp Gly Ser Phe lie Thr Leu 

405 410 415 

Arg Ser Arg Trp Phe Ser Phe Met Asn Pro Trp Thr Lys Glu Val Glu 

420 425 430 

Tyr lie Val Ser Thr Asn Thr Val Val Leu Ala Asn Val Leu Glu Gly 
435 440 445 

Gly Asp Pro Thr Phe Pro Gin Leu Thr Ala Ser Pro His Ser Met Asp 
450 455 460 

Ser Met Leu Pro Ser Gly Glu Gly Gly Pro Lys Arg Thr His Pro Thr 
465 470 475 480 

Val Pro Gly lie Pro Gly Gly Thr Arg Ala Gly Ala Gly Lys lie Gly 

485 490 495 

Arg Met lie Ala Glu Glu lie Met Glu lie His Arg lie Arg Gly Ser 

500 505 510 

Ser Pro Ser Ser Cys Gly Ser Ser Pro Leu Asn lie Thr Ser Thr Pro 
515 520 525 

Pro Pro Asp Ala Ser Ser Pro Gly Gly Lys Lys lie Leu Asn Gly Gly 
530 535 540 

Thr Pro Asp lie Pro Ser Ser Gly Leu Leu Ser Gly Gin Ala Gin Glu 
545 550 555 560 
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Asn Pro Gly Tyr Pro Tyr Ser Asp Ser Ser Ser lie Leu Gly Glu Asn 

565 570 575 

Pro His lie Gly lie Asp Met lie Asp Asn Asp Gin Gly Ser Ser Ser 

580 585 590 

Pro Ser Asn Asp Glu Ala Ala Met Ala Val lie Met Ser Leu Leu Glu 
595 600 605 

Ala Asp Ala Gly Leu Gly Gly Pro Val Asp Phe Ser Asp Leu Pro Trp 
610 615 620 



Pro Leu 
625 



<210> 11 
<211> 716 
<212> PRT 
<213> Met-JHR 

<400> 11 

Met Ala Ala Pro Glu Thr Gly Asn Thr Gly Ser Thr Gly Ser Ala Gly 
15 10 15 

Ser Thr Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Ser Ser 

20 25 30 

Asp Pro Ala Asn Gly Arg Glu Ala Arg Asn Leu Ala Glu Lys Gin Arg 
35 40 45 

Arg Asp Lys Leu Asn Ala Ser lie Gin Glu Leu Ala Thr Met Val Pro 
50 55 60 

His Ala Ala Glu Ser Ser Arg Arg Leu Asp Lys Thr Ala Val Leu Arg 
65 70 75 80 

Phe Ala Thr His Gly Leu Arg Leu Gin Tyr Val Phe Gly Lys Ser Ala 

85 90 95 

Ser Arg Arg Arg Lys Lys Pro Gly Leu Lys Gly Thr Gly Met Ser Ala 

100 105 110 

Ser Pro Val Gly Asp Leu Pro Asn Pro Ser Leu His Leu Thr Asp Thr 
115 120 125 

Leu Met Gin Leu Leu Asp Cys Cys Phe Leu Thr Leu Thr Cys Ser Gly 
130 135 140 

Gin lie Val Leu Val Ser Thr Ser Val Glu Gin Leu Leu Gly His Cys 
145 150 155 160 

Gin Ser Asp Leu Tyr Gly Gin Asn Leu Leu Gin lie Thr His Pro Asp 

165 170 175 

Asp Gin Asp Leu Leu Arg Gin Gin Leu lie Pro Arg Asp lie Glu Thr 

180 185 190 

Leu Phe Tyr Gin His Gin His His Gin Gin Gin Gly His Asn Pro Gin 
195 200 205 



AMENDED SHEET 



is IPEA/Uo 11 MAY 1999 

Gin His Ser ThWer Thr Ser Ala Ser Ala Ser Gly^^r Asp Leu Glu 

210 215 220 

Glu Glu Glu Met Glu Thr Glu Glu His Arg Leu Gly Arg Gin Gin Gly 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp His Pro Tyr Asn Arg Arg Thr Pro Ser 

245 250 255 

Pro Arg Arg Met Ala His Leu Ala Thr He Asp Asp Arg Leu Arg Met 

260 265 270 

Asp Trp Arg Cys Phe Thr Val Arg Leu Ala Arg Ala Ser Thr Arg Ala - 
275 280 285 



Glu Ala Thr Arg 
290 

Arg Ser Asp Ser 
305 

Val Ser Gin Leu 



Ala Ala Ala Val 

340 

Ala He Ala Gin 
355 

Leu Val Ala Met 
370 

Thr Glu Gly Thr 
385 

Gin Leu Glu Tyr 



His Tyr Glu Arg 
295 

Ser Leu Thr Gly 
310 

He Arg Arg Ser 
325 

Ala Ala Glu Ala 



Ala Ala Leu His 

360 

Ala Arg Val Leu 
375 

Val Gly Leu Thr 
390 

His Thr Arg His 
405 



Val Lys He Asp 

300 

Gly Ala Ala Ala 
315 

Arg Asn Asn Asn 
330 

Ala Thr Val Pro 
345 

Gly He Ser Gly 



Arg Glu Glu Arg 

380 

He Tyr Arg Gin 
395 

Leu He Asp Gly 



Gly Cys Phe Arg 



Asn Tyr Pro He 

320 

Met Leu Ala Ala 
335 

Pro Gin His Asp 
350 

Asn Asp He Val 
365 

Pro Pro Glu Glu 



Pro Glu Pro Tyr 

400 

Ser He He Asp 
415 



Cys Asp Gin Arg He Gly Leu Val Ala Gly Tyr Met Lys Asp Glu Val 

420 425 430 

Arg Asn Leu Ser Pro Phe Cys Phe Met His Leu Asp Asp Val Arg Trp 
435 440 445 

Val He Val Ala Leu Arg Gin Met Tyr Asp Cys Asn Ser Asp Tyr Gly 
450 455 460 

Glu Ser Cys Tyr Arg Leu Leu Ser Arg Asn Gly Arg Phe He Tyr Leu 
465 470 475 480 

His Thr Lys Gly Phe Leu Glu Val Asp Arg Gly Ser Asn Lys Val His 

485 490 495 

Ser Phe Leu Cys Val Asn Thr Leu Leu Asp Glu Glu Ala Gly Arg Gin 

500 505 510 

Lys Val Gin Glu Met Lys Glu Lys Phe Ser Thr He He Lys Ala Glu 
515 520 525 
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~er Ser Ser Pro Asp Leu Pro Ala sir Gin Ala Pro * i%*%J%J 



Met Pro Thr Ginger Ser Ser Pro Asp Leu Pro Ala 
530 535 540 

Gin Gin Leu Glu Arg lie Val Leu Tyr Leu lie Glu Asn Leu Gin Lys 
545 550 555 560 

Ser Val Asp Ser Ala Glu Thr Val Gly Gly Gin Gly Met Glu Ser Leu 

565 570 575 

Met Asp Asp Gly Tyr Ser Ser Pro Ala Asn Thr Leu Thr Leu Glu Glu 

580 585 590 

Leu Ala Pro Ser Pro Thr Pro Ala Leu Ala Leu Val Pro Pro Ala Pro 
595 600 605 

Ser Ser Val Lys Ser Ser lie Ser Lys Ser Val Ser Val Val Asn Val 
610 615 620 

Thr Ala Ala Arg Lys Phe Gin Gin Glu His Gin Lys Gin Arg Glu Arg 
625 630 635 640 

Asp Arg Glu Gin Leu Lys Glu Arg Thr Asn Ser Thr Gin Gly Val lie 

645 650 655 

Arg Gin Leu Ser Ser Cys Leu Ser Glu Ala Glu Thr Ala Ser Cys lie 

660 665 670 

Leu Ser Pro Ala Ser Ser Leu Ser Ala Ser Glu Ala Pro Asp Thr Pro 
675 680 685 

Asp Pro His Ser Asn Thr Ser Pro Pro Pro Ser Leu His Thr Arg Pro 
690 695 700 

Ser Val Leu His Arg Thr Leu Thr Ser Thr Leu Arg 
705 710 715 



<210> 12 
<211> 808 
<212> PRT 
<213> human 

<400> 12 

Met Asn Ser Ser Ser Ala Asn lie Thr Tyr Ala Ser Arg Lys Arg Arg 
15 10 15 



Lys Pro Val Gin 

20 

Ser Asn Pro Ser 
35 

Arg Leu Ala Ser 
50 

Asp Lys Leu Ser 
65 

Ser Phe Phe Asp 



Lys Thr Val Lys 



Lys Arg His Arg 

40 

Leu Leu Pro Phe 
55 

Val Leu Arg Leu 
70 

Val Ala Leu Lys 
85 



Pro lie Pro Ala 
25 

Asp Arg Leu Asn 



Pro Gin Asp Val 

60 

Ser Val Ser Tyr 
75 

Ser Ser Pro Thr 
90 



Glu Gly lie Lys 
30 

Thr Glu Leu Asp 
45 

lie Asn Lys Leu 



Leu Arg Ala Lys 

80 

Glu Arg Asn Gly 
95 
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Gly Gin Asp Asr? 

100 
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Wys Arg Ala Ala Asn Phe Arg Glu 

105 
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Leu Asn Leu 
110 



Gin Glu Gly Glu Phe Leu Leu Gin Ala Leu Asn Gly Phe Val Leu Val 
115 120 125 

Val Thr Thr Asp Ala Leu Val Phe Tyr Ala Ser Ser Thr He Gin Asp 
130 135 140 

Tyr Leu Gly Phe Gin Gin Ser Asp Val He His Gin Ser Val Tyr Glu 
145 150 155 160 

Leu He His Thr Glu Asp Arg Ala Glu Phe Gin Arg Gin Leu His Trp 

165 170 175 

Ala Leu Asn Pro Ser Gin Cys Thr Glu Ser Gly Gin Gly He Glu Glu 

180 185 190 - 



Ala Thr Gly Leu Pro Gin Thr Val Val Cys Tyr Asn Pro Asp Gin He 
195 200 205 



Pro Pro Glu Asn 
210 

Arg Cys Leu Leu 
225 

Gly Lys Leu Lys 



Ser He Leu Pro 

260 

Gin Pro Pro Ser 
275 

Thr Lys His Lys 
290 

Arg He Val Leu 
305 

Gly Tyr Gin Phe 



His He Arg Met 

340 

Leu Leu Thr Lys 
355 

Leu Leu Tyr Lys 
370 



Ser Pro Leu Met 
215 

Asp Asn Ser Ser 
230 

Tyr Leu His Gly 
245 

Pro Gin Leu Ala 



He Leu Glu He 

280 

Leu Asp Phe Thr 
295 

Gly Tyr Thr Glu 
310 

He His Ala Ala 
325 

He Lys Thr Gly 



Asn Asn Arg Trp 

360 

Asn Gly Arg Pro 
375 



Glu Arg Cys Phe 

220 

Gly Phe Leu Ala 
235 

Gin Lys Lys Lys 
250 

Leu Phe Ala He 
265 

Arg Thr Lys Asn 



Pro He Gly Cys 

300 

Ala Glu Leu Cys 
315 

Asp Met Leu Tyr 
330 

Glu Ser Gly Met 
345 

Thr Trp Val Gin 



Asp Tyr He He 

380 



He Cys Arg Leu 



Met Asn Phe Gin 

240 

Gly Lys Asp Gly 
255 

Ala Thr Pro Leu 
270 

Phe He Phe Arg 
285 

Asp Ala Lys Gly 



Thr Arg Gly Ser 

320 

Cys Ala Glu Ser 
335 

He Val Phe Arg 
350 

Ser Asn Ala Arg 
365 

Val Thr Gin Arg 



Pro Leu Thr Asp 
385 

Lys Leu Pro Phe 



Glu Glu Gly Thr 
390 

Met Phe Thr Thr 
405 



Glu His Leu Arg 
395 

Gly Glu Ala Val 
410 



Lys Arg Asn Thr 

400 

Leu Tyr Glu Ala 
415 



mil 

21 



Thr Asn Pro Phe Pro Ala lie Met Asp Pro Leu Pro Leu Arg Thr Lys 

420 425 430 

Asn Gly Thr Ser Gly Lys Asp Ser Ala Thr Thr Ser Thr Leu Ser Lys 
435 440 445 

Asp Ser Leu Asn Pro Ser Ser Leu Leu Ala Ala Met Met Gin Gin Asp 
450 455 460 

Glu Ser lie Tyr Leu Tyr Pro Ala Ser Ser Thr Ser Ser Thr Ala Pro 
465 470 475 480 

Phe Glu Asn Asn Phe Phe Asn Glu Ser Met Asn Glu Cys Arg Asn Trp 

485 490 495 

Gin Asp Asn Thr Ala Pro Met Gly Asn Asp Thr lie Leu Lys His Glu 

500 505 510 

Gin lie Asp Gin Pro Gin Asp Val Asn Ser Phe Ala Gly Gly His Pro 
515 520 525 

Gly Leu Phe Gin Asp Ser Lys Asn Ser Asp Leu Tyr Ser lie Met Lys 
530 535 540 

Asn Leu Gly lie Asp Phe Glu Asp lie Arg His Met Gin Asn Glu Lys 
545 550 555 560 

Phe Phe Arg Asn Asp Phe Ser Gly Glu Val Asp Phe Arg Asp lie Asp 

565 570 575 

Leu Thr Asp Glu lie Leu Thr Tyr Val Gin Asp Ser Leu Ser Lys Ser 

580 585 590 

Pro Phe lie Pro Ser Asp Tyr Gin Gin Gin Gin Ser Leu Ala Leu Asn 
595 600 605 

Ser Ser Cys Met Val Gin Glu His Leu His Leu Glu Gin Gin Gin Gin 
610 615 620 

His His Gin Lys Gin Val Val Val Glu Pro Gin Gin Gin Leu Cys Gin 
625 630 635 640 

Lys Met Lys His Met Gin Val Asn Gly Met Phe Glu Asn Trp Asn Ser 

645 650 655 

Asn Gin Phe Val Pro Phe Asn Cys Pro Gin Gin Asp Pro Gin Gin Tyr 

660 665 670 

Asn Val Phe Thr Asp Leu His Gly lie Ser Gin Glu Phe Pro Tyr Lys 
675 680 685 

Ser Glu Met Asp Ser Met Pro Tyr Thr Gin Asn Phe lie Ser Cys Asn 
690 695 700 

Gin Pro Val Leu Pro Gin His Ser Lys Cys Thr Glu Leu Asp Tyr Pro 
705 710 715 720 

Met Gly Ser Phe Glu Pro Ser Pro Tyr Pro Thr Thr Ser Ser Leu Glu 

725 730 735 
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Asp Phe Val Thr Cys Leu Gin Leu Pro Glu Asn Gin Lys His Gly Leu 

740 745 750 

Asn Pro Gin Ser Ala lie lie Thr Pro Gin Thr Cys Tyr Ala Gly Ala 
755 760 765 

Val Ser Met Tyr Gin Cys Gin Pro Glu Pro Gin His Thr His Val Gly 
770 775 780 

Gin Met Gin Tyr Asn Pro Val Leu Pro Gly Gin Gin Ala Phe Leu Asn 
785 790 795 800 

Lys Phe Gin Asn Gly Val Phe Lys 

805 



PCT/US 9«/u/^o ( 
IPEA/USli MAY to 



<210> 13 
<211> 10 
<212> DNA 

<213> Met-JHR cDNA 
<400> 13 

caaaatggca 10 

<210> 14 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: motif that 

interacts with co-activators of steroin receptors 

<220> 

<223> Xaas may be any amino acid 
<400> 14 

Leu Xaa Xaa Leu Leu 
1 5 
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and the DNA binding region of a second polypeptide, (2) 
DNA encoding a bHLH-PAS polypeptide and the activation 
domain of said second polypeptide, and (3) a reporter 
gene under the control of a minimal promoter driven by 
the response element for said second polypeptide; 

(b) incubating a test compound with said host cell; 

and 

(c) detecting the binding of the test compound with 
said bHLH-PAS polypeptide by monitoring expression of the 
reporter gene . 

37. A method according to claim 32, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

38. A method according to claim 33, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

39. A method according to claim 34, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

40. A method according to claim 35, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

41. A method according to claim 36, wherein said 
host cell is selected from the group of an insect cell, 
a yeast cell, and a mammalian cell. 

42. An isolated polynucleotide which comprises the 
sequence of SEQ ID NO: 6. 

43. An isolated polynucleotide which comprises the 
sequence of SEQ ID NO: 7. 
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